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Abstract 


The present chapter deals with the study of general family of factor-type 
estimators for estimating population mean of stratified population in the presence of non- 
response whenever information on an auxiliary variable are available. The proposed 
family includes separate ratio, product, dual to ratio and usual sample mean estimators as 
its particular cases and exhibits some nice properties as regards to locate the optimum 
estimator belonging to the family. Choice of appropriate estimator in the family in order 
to get a desired level of accuracy even if non-response is high, is also discussed. The 


empirical study has been carried out in support of the results. 


Keywords: Factor-type estimators, Stratified population, Non-response, Optimum 
estimator, Empirical study. 


1. Introduction 

In sampling theory the use of suitable auxiliary information results in 
considerable reduction in variance of the estimator. For this reason, many authors used 
the auxiliary information at the estimation stage. Cochran (1940) was the first who used 
the auxiliary information at the estimation stage in estimating the population parameters. 
He proposed the ratio estimator to estimate the population mean or total of a character 
under study. Hansen et. al. (1953) suggested the difference estimator which was 
subsequently modified to give the linear regression estimator for the population mean or 
its total. Murthy (1964) have studied the product estimator to estimate the population 


mean or total when the character under study and the auxiliary character are negatively 
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correlated. These estimators can be used more efficiently than the mean per unit 
estimator. 

There are several authors who have suggested estimators using some known 
population parameters of an auxiliary variable. Upadhyaya and Singh (1999) have 
suggested the class of estimators in simple random sampling. Kadilar and Cingi (2003) 
and Shabbir and Gupta (2005) extended these estimators for the stratified random 
sampling. Singh et. al. (2008) suggested class of estimators using power transformation 
based on the estimators developed by Kadilar and Cingi (2003). Kadilar and Cingi (2005) 
and Shabbir and Gupta (2006) have suggested new ratio estimators in stratified sampling 
to improve the efficiency of the estimators. Koyuncu and Kadilar (2008) have proposed 
families of estimators for estimating population mean in stratified random sampling by 
considering the estimators proposed in Searls (1964) and Khoshnevisan et. al. (2007). 
Singh and Vishwakarma (2008) have suggested a family of estimators using 
transformation in the stratified random sampling. Recently, Koyuncu and Kadilar (2009) 
have proposed a general family of estimators, which uses the information of two auxiliary 
variables in the stratified random sampling to estimate the population mean of the 


variable under study. 


The works which have been mentioned above are based on the assumption that 
both the study and auxiliary variables are free from any kind of non-sampling error. But, 
in practice, however the problem of non-response often arises in sample surveys. In such 
situations while single survey variable is under investigation, the problem of estimating 


population mean using sub-sampling scheme was first considered by Hansen and Hurwitz 


(1946). If we have incomplete information on study variable X, and complete 


information on auxiliary variable X,, in other words if the study variable is affected by 


non-response error but the auxiliary variable is free from non-response. Then utilizing the 
Hansen-Hurwitz (1946) technique of sub-sampling of the non-respondents, the 
conventional ratio and product estimators in the presence of non-response are 


respectively given by 
Tig = (Taee 3 )X (1.1) 
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and Te. = (TeX. (1.2) 


The purpose of the present chapter is to suggest separate-type estimators in 
stratified population for estimating population mean using the concept of sub-sampling of 
non-respondents in the presence of non-response in study variable in the population. In 
this context, the information on an auxiliary characteristic closely related to the study 


variable, has been utilized assuming that it is free from non-response. 


In order to suggest separate-type estimators, we have made use of Factor-Type 
Estimators (FTE) proposed by Singh and Shukla (1987). FTE define a class of estimators 
involving usual sample mean estimator, usual ratio and product estimators and some 
other estimators existing in literature. This class of estimators exhibits some nice 


properties which have been discussed in subsequent sections. 
2. Sampling Strategy and Estimation Procedure 

Let us consider a population consisting of N units divided into x strata. Let the 
size of i” stratum is N ;, (i =1,2,...,4) and we decide to select a sample of size n from the 


entire population in such a way that n, units are selected from the 7” stratum. Thus, we 
k 

have )n, =n. Let the non-response occurs in each stratum. Then using Hansen and 
i=l 

Hurwitz (1946) procedure we select a sample of size m, units out of n,,non-respondent 


units in the i” stratum with the help of simple random sampling without replacement 


(SRSWOR) scheme such that n,. =L,m,, L, 21 and the information are observed on all 


the m, units by interview method. 


The Hansen-Hurwitz estimator of population mean X o: of study variable X, for 


the i” stratum will be 


«Ny Xo FN XOmi : 
1); > (i 
Nn. 


L 


10%) ) (2.1) 
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where xo and Xom are the sample means based on n,, respondent units and m, non- 


respondent units respectively in the i” stratum for the study variable. 


Obviously 7;, is an unbiased estimator of X0:. Combining the estimators over all 


the strata we get the estimator of population mean Xo of study variable X,, given by 


k 
Tou = Dy PT o (2.2) 


which is an unbiased estimator of Xo. Now, we define the estimator of population mean 


X\ of auxiliary variable X , as 
k 
Ty = Pik (2.3) 


where x1: is the sample mean based on n, units in the i” stratum for the auxiliary variable. 


It can easily be seen that 7, is an unbiased estimator of X: because x1 gives unbiased 


st 


estimates of the population mean Xyof auxiliary variable for the i” stratum. 


3. Suggested Family of Estimators 


Let us now consider the situation in which the study variable is subjected to non- 


response and the auxiliary variable is free from non-response. Motivated by Singh and 


Shukla (1987), we define the separate-type family of estimators of population mean Ki 


using factor-type estimators as 


Trs(ar)= 2) PT (@) (3.1) 
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wn nonce 
li li 


* 


, A=(a-I)(@-2), B=(a-Ia-4), C= (a—-2)a-3Xa—4); 
3.1 Particular Cases of T,,,(@) 
Case-1: If @=1 then A=B=0 , C=-6 


so that T,,(1)=T,, ae, 
Xi 


k ; 
and hence T,,. (1) = ops. —. 
i=l 
Thus, 7;,;(1)is the usual separate ratio estimator under non-response. 
Case-2: If @=2 then A=O=C , B=-2 


so that’ 7,,(2)=7,, — 


Xi 
Xi 


k 
and hence T7,,(2) = ply 
i=l 


which is the usual separate product estimator under non-response. 


Case-3: If @=3 then A=2 ,B=-2, C=0 


so that T,,(3)=T,, ie 
_ li 


k 
and hence T7,,,(3)= ae (3) 
i=l 
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(3.2) 


a>o. 


(3.3) 


(3.4) 


(3.5) 


which is the separate dual to ratio-type estimator under non-response. The dual to ratio 


type estimator was proposed by Srivenkataramana (1980). 
Case-4: If ~=4 then A=6 ,B=0, C=0 
so that 7,,(4)=T,, 


k 
and hence T,,,(4) = lh aT (3.6) 


i=l 
which is usual mean estimator defined in stratified population under non-response. 


3.2 Properties of 7,,,(@) 


Using large sample approximation, the bias of the estimator 7,,,(@), up to the first 


order of approximation was obtained following Singh and Shukla (1987) as 


- AalSn.Xo| | . Ci pa Cal | (3.7) 


i= n, WN, }_ A+ fB+C 
= S,. S,. . 
where (ga? C, ==, C,==+., S, and S? are the population 
A+ fB+C X 0 X ii 


h 


mean squares of study and auxiliary variables respectively in the i" stratum . ,),, is the 


population correlation coefficient between X,and X, in the i” stratum. The Mean 


Square Error (MSE) up to the first order of approximation was derived as 


MIT (a)|= Eles (a)-¥ of 
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~ 5 i Xe] MD ot He) (a) atti ; 


i=l X 0i Xi Xo; Xi 
Since v(r,,)= : Soi i, Wi Soin V(xw)= cau Si 
n, WN; n; n, WN, 
and Cov x1) = Ts Poi irr [ due to Singh (1998)]. 
Nn; i 


th 


where S;,, is the population mean square of the non-response group in the i” stratum 


and W,, is the non-response rate of the i“ stratum in the population. 


Therefore, we have 


El 1 
MPs (a)|= 54-2 ps [s2 - g(a) Rien =29(@)Roi Poi SoiS1i 
i=l i i 
eg Sa | 
+2 — WP i Son (3.8) 

zt 1, 

where Se oa 
Xi 


3.3 Optimum Choice of @ 


In order to obtain minimum MSE ofT,,(@), we differentiate the MSE with 


respect to @ and equate the derivative to zero 
“(1 1 2 ' 2 92 ‘ 
oF —_ TI Pi 29 (a )(a)Ro Si —2¢ (Q)Ro1: Poi SoS: = 0, (3.9) 


where ¢'(q) stands for first derivative of ¢(@).From the above expression, we have 
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jae 
—|p°R,..9,..8,.8.. 
N Jp ou Poo 1 


¢(a) = —— : F =V (say). (3.10) 
ate Lp Re 2 
N, Jp Oli Li 


It is easy to observe that ¢(@) is a cubic equation in the parameter @ . Therefore, 
the equation (3.10) will have at the most three real roots at which the MSE of the 


estimator T,,;(@) attains its minimum. 


Let the equation (3.10) yields solutions as@,, @, and a@, such that WM Ir 7 (a)| is 
same. A criterion of making a choice between@,, @, and @, is that “compute the bias of 
the estimator at @=a,, @, and @, and select @,, at which bias is the least”. This is a 


novel property of the FTE. 


3.4 Reducing MSE through Appropriate Choice of @ 


By using FTE for defining the separate-type estimators in this chapter, we have an 
advantage in terms of the reduction of the value of MSE of the estimator to a desired 
extent by an appropriate choice of the parameter @ even if the non-response rate is high 


in the population. The procedure is described below: 


Since MSE’s of the proposed strategies are functions of the unknown parameter 
@ as well as functions of non-response rates W,,, it is obvious that if @ is taken to be 
constant, MSE’s increase with increasing non-response rate, if other characteristics of the 
population remain unchanged, along with the ratio to be sub sampled in the non-response 
class, that is, L;. It is also true that more the non-response rate, greater would be the size 
of the non-response group in the sample and, therefore, in order to lowering down the 
MSE of the estimator, the size of sub sampled units should be increased so as to keep the 
value of L, in the vicinity of 1; but this would, in term, cost more because more effort 
and money would be required to obtain information on sub sampled units through 


personal interview method. Thus, increasing the size of the sub sampled units in order to 
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reduce the MSE is not a feasible solution if non-response rate is supposed to be large 
enough. 

The classical estimators such as Ty, Typ, Top, discussed earlier in literature in 
presence of non-response are not helpful in the reduction of MSE to a desired level. In all 
these estimators, the only controlling factor for lowering down the MSE isL,, if one 
desires so. 

By utilizing FTE in order to propose separate- type estimators in the present work, 


we are able to control the precision of the estimator to a desired level only by making an 


appropriate choice of @. 


Let the non-response rate and mean-square of the non-response group in the i” 


stratum at a time be W,, = or Tespectively. Then, for a choice of @=a,,, the 


i 


MSE of the estimator would be 
[ l= Af 1 1 2 2 Day 9s ed 
M T,5(@)/W,, =) a ONE DP; |Soi + ¢(@,) Roi Si: — 29(@ Roi: Poi SoiS1i 
i=l i i 


L.-1 
W2P; Soin (3.11) 


i 


k 
oa 
i=l 


Let us now suppose that the non-response rate increased over time and it is 


roe : : : 
W,,. = a such that V,, > N,,. Obviously, with change in non-response rate, only the 


i 


parameter S;,, will change. Let it becomes S,,,. Then we have 


AY J 
M{f;5(@) =— xi [s2 +0 (a@,)° Rio 20(a,, Row PoaS oS 


L-ly, 
2s n, WP Pi Soin (3.12) 
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Clearly, if @ =a@,and S;, > S>,, then MIT. (ar) W,, | = MIPes (ar) W,|. Therefore, 


we have to select a suitable valuea@,, such that even if W,,>W,, and Sy, >So.55 
expression (3.12) becomes equal to equation (3.11) that is, the MSE of T,,(a) is reduced 


to a desired level given by (3.11). Equating (3.11) to (3.12) and solving for d(a,), we get 


k 1 1 k 
pcr, ry[2- N. +p Roney 2¢( Qa, y{+- Li Roi PouS oii 


i=l 


Af 1 1 
Er Jprolos) Ri, 3-244 Ra Po SoS 


i 


| 
YA 2 S22 Wi) 0. (3.13) 


which is quadratic equation in (a, ). On solving the above equation, the roots are 


obtained as 


; 2 
B 1 1 2 k 1 1 
2, n, N, Pi Roi Po oS 1i 2, n, i Roi Pou Sori 
o(@,)= Ef] i + EG 4 
x(2 ae sa 2, n, N, PROS: 
es 
L.-1 a 2 


k 
Jp (a)? Rosi 2¢(a%, Roi Pov So Sv td ; D; (7,82, -W,,S2,) 


ist Nj 


. 1 1 2 p2 2 
—-— |p Ro S| 
x2 N Jp Oli~ li 


i 


(3.14) 


The above equation provides the value of @ on which one can obtain the precision 
to a desired level. Sometimes the roots given by the above equation may be imaginary. 


So, in order that the roots are real, the conditions on the value of a, are given by 
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Jp? RoseoiSes 


k 


>: i 


ist N; 


L,-1 


D; (7,82, a W,S2.) 


1 


2p2 2 
“Ro SS 
N lp Oli li 


L 


JpPoPoiSoS 


A L.-1 


> i 


2p2 2 
“Ri .SS 
N lp Oli li 


i 


Nie 


DP; (7,82, = W, S29) 


4, Empirical Study 


1 


N, 


L 


Jpreasi 


1 
N, 


L 


pe 
nN. 


(3.15) 


(3.16) 


In this section, therefore, we have illustrated the results, derived above, on the 


basis of some empirical data. For this purpose, a data set has been taken into 


consideration. Here the population is MU284 population available in Sarndal et. al. 


(1992, page 652, Appendix B). We have considered the population in the year 1985 as 


study variable and that in the year 1975 as auxiliary variable. There are 284 


municipalities which have been divided randomly in to four strata having sizes 73, 70, 97 


and 44. 


Table 1 shows the values of the parameters of the population under consideration 


for the four strata which are needed in computational procedure. 


Table 1: Parameters of the Population 


Stratum 
Stratum : Mean | Mean ; : F 

: Bue (x ) (x ) ise (s? ) Soi Si; Pou (s:., 
(i) (N.) 0i li 

1 73 40.85 39.56 | 6369.0999 | 6624.4398 | 79.8066 | 81.3907 | 0.999 | 618.8844 
2 70 27.83 27.57 | 1051.0725 | 1147.0111 | 32.4202 | 33.8676 | 0.998 | 240.9050 
3 97 25.79 | 25.44 | 2014.9651 | 2205.4021 | 44.8884 | 46.9617 | 0.999 | 265.5220 
4 44 20.64 | 20.36 | 538.4749 | 485.2655 | 23.2051 | 22.0287 | 0.997 83.6944 
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The value of Rp, = Xo /X1 comes out to be 1.0192. 


We fix the sample size to be 60. Then the allocation of samples to different strata 


under proportional and Neyman allocations are shown in the following table 


Table 2: Allocation of Sample 


Stratum Size of Samples under 
(i ) Proportional Allocation Neyman Allocation 
1 15 26 
2 15 10 
3 21 19 
4 9 5 


On the basis of the equation (3.10), we obtained the optimum values of @: 
Under Proportional Allocation 


o(ar)= 0.9491, a@,,,= (31.9975, 2.6128, 1.12) and 


Under Neyman Allocation 


G(c1)= 0.9527, @,,,= (34.1435, 2.6114, 1.1123). 


The following table depicts the values of the MSE’s of the estimators T,,,(a) for 


Qa,,,, @=1 and 4 under proportional and Neyman allocations. A comparison of MSE of 


opt ? 
T,< (ac) with a,, and @=I1with that at @=4reveals the fact that the utilization of 
auxiliary information at the estimation stage certainly improves the efficiency of the 


estimator as compared to the usual mean estimator 7,,,. 
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Table 3: MSE Comparison (L, =2, W,, =10% for all 7) 


MSE Allocation 
Proportional Neyman 
M{Z (2) oot 0.6264 0.6015 
MIT. (1) 0.7270 0.6705 
M{T,s(4)|<V[7,.,| Sn 28.6080 


We shall now illustrate how by an appropriate choice of @ , the MSE of the 
estimators T,,,(@) can be reduced to a desired level even if the non-response rate is 


increased. 


Let us take L, =2, W,, =0.1, W,, =0.3 and S\,,. ==(in) for all i 


0i2 
Under Proportional Allocation 


From the condition (3.15) and (3.16), we have conditions for real roots of ¢(a, ) 


as 


(a, ) > 1.1527 and (a, ) < 0.7454. 
Therefore, if we take @(@, ) = 1.20, then for this choice of ¢(@,), we get 


MTs (|W, | = 3.0712 and M[T,.(@)W;, | = 4.6818. 


Thus, there is about 52 percent increase in the MSE of the estimator if non- 


response rate is tripled. Now using (3.14), we get @(a, )=1.0957 and 0.8025. At this value 
of ¢(a,), M [7 ae (a) reduces to 3.0712 even if non-response rate is 30 percent. Thus a 


possible choice of @ may be made in order to reduce the MSE to a desired level. 
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Under Neyman Allocation 


Conditions for real roots of ¢(a,, ) 
(a, ) > 1.1746 and (a, ) < 0.7309. 
If (a, ) = 1.20 then we have 
W,, | = 4.0072. 


MIT.(@)W,, | = 2.4885 and MIT, (a) 


Further, we get from (3.14), 6(@, )=1.0620 and 0.8435, so that 


MIT, (|W, |=2.4885 for o(a,, )=1.0620. 


5. Conclusion 


We have suggested a general family of factor-type estimators for estimating the 
population mean in stratified random sampling under non-response using an auxiliary 
variable. The optimum property of the family has been discussed. It has also been 
discussed about the choice of appropriate estimator of the family in order to get a desired 
level of accuracy even if non-response is high. The Table 3 reveals that the optimum 
estimator of the suggested family has greater precision than separate ratio and sample 
mean estimators. Besides it, the reduction of MSE of the estimators T,,(@) to a desired 
extent by an appropriate choice of the parameter @ even if the non-response rate is high 


in the population, has also been illustrated. 
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